A System for Keyword Search on Probability XML Data

نویسندگان

  • Weidong Yang
  • Hao Zhu
  • Zheng Zheng
  • Huirong Chen
  • Lei Wang
چکیده

Many probabilistic XML data models have been proposed to store XML data with uncertainty information, and based on them the issues such as structured querying are extensively studied. As an alternative to structured querying, keyword search in probabilistic XML data needs to be concerned. In this paper we addressed the issue of keyword search on probabilistic XML data. The probabilistic XML data is viewed as a labeled tree, and a concept of Minimum Meaningful Fragment (MMF) is defined as the searching result. A MMF is a minimum subtree of the probabilistic XML data which has a positive probability of containing all keywords. To sort the MMFs a novel scoring function mainly considering the degree of uncertainty information is presented. We propose a system to compute top-k searching results efficiently based on the scoring function. The experiments shows the efficiency for our system.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A System for Keyword Proximity Search on XML Databases

Keyword proximity search is a user-friendly information discovery technique that has been extensively studied for text documents. In extending this technique to structured databases, recent works [6, 7, 4, 2] provide keyword proximity search on labeled graphs. A keyword proximity search does not require the user to know the structure of the graph, the role of the objects containing the keywords...

متن کامل

Ontology-Driven Keyword Search for Heterogeneous XML Data Sources

Massive heterogeneous XML data sources emerge on the Internet nowadays. These data sources are generally autonomous and provide search interfaces of XML query language such as XPath or XQuery. Accordingly, users need to learn complex syntaxes and know the schemas. Keyword Search is a user-friendly information discovery technique, which can assist users in obtaining useful information convenient...

متن کامل

Interactive Fuzzy based Search over XML Data for Optimized Performance

In a traditional keyword-search system over XML data, a user composes a keyword query, submits it to the system, and retrieves relevant answers. In the case where the user has limited knowledge about the data, often the user feels “left in the dark” when issuing queries, and has to use a try-and-see approach for finding information. In this paper we study, TASX Type-Ahead Search in XML data, a ...

متن کامل

ICRA: Effective Semantics for Ranked XML Keyword Search

Keyword search is a user-friendly way to query XML databases. Most previous efforts in this area focus on keyword proximity search in XML based on either tree data model or graph (or digraph) data model. Tree data model for XML is generally simple and efficient for keyword proximity search. However, it cannot capture connections such as ID references in XML databases. In the contrast, technique...

متن کامل

Keyword Search on Both XML and Relational Data

Keyword search is a familiar and effective method to retrieve information under databases for any user. Keyword search on structured/semi-structured databases has received significant attention in recent years. A number of solutions have been proposed and many prototypes have been developed. Building on growing user needs, recently several RDBMS (IBM DB2, etc.) have made themselves be compatibl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013